Cogflorence 2 Large Freeze
MIT
This is a fine-tuned version of the microsoft/Florence-2-large model, trained on a subset of 38,000 images from the Ejafa/ye-pop dataset, using CogVLM2-generated annotations, focusing on image-to-text tasks.
Image-to-Text
Transformers Supports Multiple Languages